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Abstract 

Cascade  models  humans  learning  college  physics  by 
studying  examples  and  solving  problems.  It  simu¬ 
lates  the  main  qualitative  phenomena  visible  in  hu 
man  protocols  of  learning,  including  several  strategies 
for  analogical  and  non-analogical  problem  solving,  and 
two  strategies  for  studying  examples.  It  learns  at  the 
knowledge  level  by  acquiring  new  physics  rules,  and 
it  learns  search  control  knowledge.  Most  importantly, 
it  models  a  recently  observed  phenomenon,  the  self- 
explanation  effect,  which  correlates  students’  example 
studying  strategies  with  the  amount  they  learn. 

INTRODUCTION 

The  long  term  goal  of  the  Cascade  project  is  to  de¬ 
velop  a  model  of  human  cognitive  skill  acquisition  in 
scientific,  mathematical  and  engineering  task  domains. 
The  goal  for  the  present  version  of  the  system.  Cascade 
3,  is  to  model  the  data  from  a  study  by  Chi,  Bassok, 
Lewis,  Reimann  and  Glaser  (1989),  who  took  proto¬ 
cols  as  students  studied  a  chapter  on  Newton’s  laws 
from  a  standard  college  physics  textbook.  The  chap¬ 
ter  consisted  of  three  distinct  types  of  material,  which 
were  presented  in  the  following  sequence:  text,  which 
defines  concepts,  explains  principles  and  presents  an¬ 
cillary  information;  examples,  which  pose  a  problem 
and  work  out  its  solution;  and  problems,  which  the 
student  solves  without  feedback  or  help.  Chi  et  al. 
discovered  that  students  who  learned  more  also  ex¬ 
plained  the  examples  to  themselves  more  thoroughly. 
The  oest  learners  checked  almost  every  line  of  the  ex- 
solution  to  see  if  they  could  generate  it  them¬ 
selves.  In  particular,  Chi  et  al.  found  that  the  num¬ 
ber  of  self-explanation  utterances  in  a  prctc''o'  was 
strongly  correlated  (r=.81)  with  the  students’  score 
on  the  problems.  Since  all  subjects  scored  roughly  the 
same  when  tested  just  prior  to  studying  the  examples, 
this  shows  that  better  learning  is  correlated  with  more 
self-explanation  of  the  e.\.aiii,/ics.  Tliio  piicr,oincno*i, 
called  the  self-explanation  clfect,  has  been  replicated 
twice  in  other  task  domains  (Pirolli  k  Bielaczyc,  1989; 
Ferguson-  Messier  k  de  Jong,  1990). 

Cascade  .3  is  a  machine  learning  program  that  models 
the  sclf-e.xplanation  effect.  This  paper  discusses  its  de¬ 


sign  and  evaluation.  (See  VanLehn  k  Jones,  in  press, 
for  a  complete  discussion  of  its  design,  and  VanLehn, 
Jones  k  Chi,  in  press,  for  a  more  complete  discussion 
of  the  evaluation).  There  were  two  kinds  of  constraints 
on  the  design.  First,  the  simulation  must  display  the 
same  overt  behaviors  as  the  subjects.  For  instance, 
subjects  often  refer  to  the  examples  while  solving  prob¬ 
lems,  and  rarely  refer  to  the  text.  This  should  be  true 
of  Cascade’s  performance  as  well.  Second,  the  simula¬ 
tion  should  learn  what  the  subjects  learn.  The  first  few 
sections  of  this  paper  list  the  overt  behaviors  and  the 
types  of  learning  that  constrain  Cascade’s  design,  and 
describe  how  Cascade  models  each.  The  last  section 
discusses  its  evaluation:  does  it  actually  reproduce  the 
self-explanation  effect? 

ORDINARY  PROBLEM  SOLVING 

During  the  later  part  of  the  Chi  et.  al  study,  students 
solve  physics  problems,  such  as  ”A  block  of  mass  m  is 
kept  at  rest  on  a  smooth  plane,  inclined  at  an  angle 
of  30  degrees  with  the  horizontal,  by  means  of  a  string 
attached  to  a  vertical  wall.  What  is  the  tension  in  the 
string?”  A  common  mode  of  work  is  ordinary  problem 
solving,  where  in  the  student  does  not  refer  to  the 
examples  or  the  text. 

Cascade  models  this  overt  behavior  by  using  a  Prolog 
meta-interpreter  to  prove  propositions  such  as 
value(tension(stringl),X) 

where  tension(stringl)  is  a  sought  quantity  mentioned 
in  the  problem  statement.  The  proof  uses  equations 
such  as 

tensior(S)  =  magnitude(force{D,S, tension)), 
which  says  that  the  tension  in  a  string  S  is  cqu.a!  to 
the  magnitude  of  the  tension  force  exerted  on  a  body 
B  by  the  string.  -Vn  eq’'alion  has  conditions  that  indi¬ 
cate  when  it  applies  (e.g.,  that  the  string  is  tied  to  the 
body).  The  combination  of  an  equation  and  its  con¬ 
dition  is  called  a  Cascade  rule.  The  meta-interpreter 
c  knowledge  built  into  it  so  a  single  Cas¬ 
cade  rule  can  be  used  to  infer  a  value  for  any  of  the 
equation’s  quantities  from  the  other  quantities’  values. 
Most  of  Cascade’s  knowledge  is  represented  with  Cas¬ 
cade  rules.  Prolog  rules  are  used  for  visual  inference 
and  other  kinds  of  uninteresting,  non-loarned  knowl¬ 
edge. 


STUDYING  EXAMPLES 

A  physics  example  consists  of  a  problem,  then  a  list  of 
lines,  where  a  line  can  be  a  force  diagram,  an  equation 
or  a  few  sentences.  For  uniformity,  Cascade  represents 
all  lines  as  equations,  which  require  inventing  some 
’’algebraic”  operators  for  representing  diagrams  and 
sentences. 

Two  fairly  distinctive  behaviors  are  common  when  stu¬ 
dents  study  an  example  line.  One  is  self-explanation. 
Students  try  to  rederive  the  line,  almost  as  if  they  are 
checking  the  work  of  the  example’s  author.  Students 
virtually  never  reflect  on  the  overall  solution  and  try 
to  recognize  a  plan  that  spans  all  the  lines.  Their  self¬ 
explanations  are  local  to  a  single  line.  Cascade  models 
self-explanation  by  proving  the  line.  If  the  search  for 
a  proof  succeeds,  the  derivation  is  stored  as  a  set  of 
pairs,  each  consisting  of  a  goal  and  the  Cascade  rule 
used  to  achieve  that  goal. 

The  second  behavior  observed  during  example  study¬ 
ing  consists  of  simply  reading  the  line,  and  sometimes 
paraphrasing  it.  Students  undoubtedly  engage  in  pars¬ 
ing  and  reference  resolution  (e.g.,  determining  that 
”Fa”  refers  to  the  force  exerted  by  string  A  on  the 
block),  and  these  processes  also  occur  as  a  front  end 
to  self-  explanation.  Instead  of  modeling  them.  Cas¬ 
cade  presents  lines  as  parsed  expressions  with  all  refer- 
ing  terms  replaced  by  their  definitions.  These  data 
structures  model  the  output  of  the  parsing  and  dis¬ 
ambiguating  processes.  Thus,  to  model  this  second 
strategy  for  studying  a  line,  Cascade  simply  stores  the 
line  in  memory. 

ANALOGICAL  PROBLEM  SOLVING 

There  are  two  rather  distinct  types  of  analogical  prob¬ 
lem  solving  evident  in  the  protocols.  In  the  first  type, 
students  read  through  an  example,  starting  at  the  be¬ 
ginning  and  stopping  when  they  find  a  line  they  can 
use  for  the  problem  they  are  working  on.  Cascade 
models  this  with  a  kind  of  transformational  analogy 
(Carbonell,  1986).  The  givens  of  a  problem  are  repre¬ 
sented  as  a  set  of  ground  literals,  such  as 
tied-to(stringl,bl). 

To  perform  an  analogs.  Cascade  retrieves  an  example, 
partially  matches  its  givens  to  the  givens  of  the  prob¬ 
lem,  and  saves  the  resulting  mapping.  [Some  details: 
Retrieval  is  not  currently  modelled  in  Cascade.  It  is 
simply  told  which  examples  go  with  wliich  problems. 
Partial  matching  enumerates  all  conslanl-to-  constant 
mappings,  subject  to  some  type  constraints  (e  g.,  con¬ 
stants  denoting  situations  may  not  be  mapped  to  con¬ 
stants  uenoting  physical  objects).  Partial  matching 
then  picks  the  mapping  that  maximizes  the  number  of 
literals  i.e  liii.  that  are  put  into  1-1  correspon¬ 

dence.)  Example  lines  are  represented  as  equations. 
To  see  if  an  example  line  is  useful,  transformational 
analogy  applies  the  saved  mapping  to  it,  which  yields 
an  equation  expressed  in  terms  of  the  problem  instead 
of  the  example.  If  lliis  equation  matches  the  current 
goal,  it  is  u;;c'i.  If  not,  Cascade  moves  on  to  the  itext 


example  line. 

The  second  type  of  analogy  is  much  more  focussed 
than  the  one  just  described.  At  the  beginning  of  a 
problem  solving  attempt,  students  say  something  like 
’’This  is  just  like  example  6”  and  open  the  book  to 
the  appropriate  page.  Although  they  probably  look 
at  the  diagrams,  they  generally  do  not  read  much  of 
the  example.  As  they  work  through  their  problem, 
they  sometimes  refer  directly  to  a  line  in  the  example 
without  reading  the  preceding  lines. 

Cciscade  models  this  kind  of  analogy  with  a  mechanism 
called  analogical  search  control.  At  the  beginning  of  a 
problem,  it  retrieves  an  example  and  forms  a  mapping 
just  as  in  transformational  analogy.  Whenever  Cas¬ 
cade  needs  to  choose  among  several  rules  for  achieving 
a  goal,  it  uses  the  map  to  find  an  equivalent  goal  in  the 
example’s  derivation,  and  if  it  succeeds,  it  applies  the 
rule  paired  with  that  goal  during  self-explanation.  If  it 
fails  to  find  such  a  goal,  perhaps  because  the  example’s 
lines  were  not  self-explained,  then  Cascade  chooses  the 
first  rule  that  matches  the  goal.  If  this  choice  fails,  w'l' 
back  up  and  try  the  next  rule  that  matches.  Analogical 
search  control  is  thus  a  heuristic  for  choosing  rules.  It 
is  also  a  form  of  learning,  because  every  time  an  exam¬ 
ple  is  explained  or  a  problem  is  solved,  its  derivation  is 
saved  and  thus  adds  more  heuristic  power  for  choosing 
rules.  Analogical  search  control  is  similar  to  the  search 
control  learning  of  Eureka  (Jones,  1989). 

LEARNING  NEW  DOMAIN  RULES 

The  textbook  does  not  mention  all  the  physics  knowl¬ 
edge  needed  for  explaining  the  examples  and  solving 
the  problems.  Many  previous  simulations  of  textbook 
learning  (e.g.,  Cohen,  1990;  VanLehn,  1987)  have  found 
the  same  incompleteness.  In  order  to  model  this  in¬ 
completeness,  we  first  constructed  a  knowledge  base 
sufficient  to  solve  all  the  problems  and  explain  all  the 
examples.  We  asked  two  people  not  involved  in  the 
construction  of  the  rules  to  judge  whether  each  rule 
was  mentioned  in  the  textbook.  The  judges  agreed 
95%  of  the  time,  and  disagreements  were  settled  by  a 
third  judge.  Of  the  62  physics  rules,  only  29  (-17%) 
were  judged  to  be  present  in  the  textbook.  Because 
some  students  get  most  of  the  problems  correct,  either 
they  knew  the  missing  knowledge  already  or  they  ac¬ 
quired  new  rules  as  the  studied  examples  and  solved 
problems.  There  is  some  empirical  evidence  against 
the  prior  knowledge  account  (\’anLehn,  Jones  A  Chi, 
in  press),  so  Cascade  is  based  on  the  hypothesis  that 
students  learn  the  missing  rules. 

Cascade  uses  explanation-based  learning  of  correct¬ 
ness  (V'anLehn,  Ball  k.  Kowalski,  1990)  to  model  do- 
mai.i  Kni«',,!eJ^e  a,,i,iiioiLion.  '.Vfic.'  t’as^-qe  c.w.r.ot 
acliicvc  its  present  goal  with  any  kiioun  domain  rule, 
it  uses  a  "non-domain”  rule.  Such  rules  are  distinc¬ 
tively  marked  in  order  to  represent  the  belief  that 
they  are  part  of  some  other  task  domain  (e  g.,  com¬ 
mon  sense  physics)  or  are  incorrect.  Tor  instance,  one 
non-domain  rule  is,  "The  properiv  \  due  of  a  cpian- 


tity  is  a  function  of  the  property  value  of  one  of  its 
defining  parts.”  This  rule  is  overly  general  and  hence 
incorrect.  However,  Cascade  uses  it  to  prove  the  ex¬ 
ample  line,  "The  incline  of  the  normal  force  on  block-1 
by  surface-1  is  perpendicular  to  the  incline  of  surface- 
1.”  A  new  domain  rule  is  created  from  this  proof  by 
specializing  the  non-domain  rule.  In  this  case,  the  new 
domain  rule  is,  ’’The  incline  of  a  normal  force  on  B  by 
S  is  perpendicular  to  the  incline  of  S.”  This  rule  is 
correct. 

Most  of  the  overly  general  rules  required  for  learning 
correct  domain  rules  have  the  same  characteristics  as 
the  one  just  mentioned.  They  all  move  property  values 
from  objects  to  related  objects.  However,  there  are  3 
occasions  where  new  types  of  forces  are  used  without 
any  introduction  or  explanation,  and  these  could  not 
be  handled  with  such  property-value  manipulations. 
For  instance,  one  problem  asserts  that  an  object  is 
prevented  from  sliding  down  an  inclined  plane  by  a 
spring  that  is  at  the  bottom  of  the  plane.  This  re¬ 
quires  inventing  a  new  object  (a  force),  so  manipulat¬ 
ing  property  values  will  not  handle  this  case.  In  order 
to  handle  such  learning  events,  we  gave  Cascade  non- 
domain  rules  for  common  sense  physics,  including  a 
rule  that  says  that  compressed  springs  push  back.  We 
also  gave  it  overly  general  rules  that  convert  common 
sense  physics  to  proper  physics.  One  such  rule  is,  ’’If 
object  X  pushes  on  object  Y,  then  there  is  a  force  on 
Y  due  to  X.” 

These  force-invention  cases  demonstrate  that  the  suc¬ 
cess  of  explanation-baised  learning  of  correctness  is  not 
a  foregone  conclusion.  It  may  seem  like  a  learning 
method  that  cannot  fail,  as  the  only  constraint  on  the 
content  of  the  overly  general  rules  is  the  inventive¬ 
ness  of  the  rules’  writer.  However,  because  we  could 
not  handle  the  force-invention  cases  by  just  invent¬ 
ing  overly  general  rules,  we  had  to  adopt  a  different 
approach,  which  is  based  on  the  intuitively  plausible 
hypothesis  (espoused  by  di  Sessa,  1983,  among  others) 
that  formal  physics  is  acquired  by  refining  and  adjust¬ 
ing  one’s  naive  physics. 

Because  Cascade  applies  explanation-based  learning  of 
correctness  (EBLC)  as  soon  as  it  fails  to  achieve  a 
goal,  it  is  deadly  for  C-ascade  to  stray  from  a  solution 
path,  for  it  has  no  way  to  tell  whether  the  failure  at 
the  end  of  a  dead  end  is  caused  by  missing  domain 
knowledge  or  by  having  made  a  wrong  search  control 
decision  earlier.  In  the  latter  case,  EBLC  will  still  be 
tried  and  may  learn  an  incorrect  rule.  Avoiding  fail¬ 
ure  paths  is  easier  when  explaining  an  example,  for 
the  goals  mention  values  m  well  as  sought  quantities, 
and  the  form  of  the  value  often  eliminates  some  of 
the  false  choices.  We  found  that  during  problem  solv¬ 
ing,  when  goals  usually  mention  only  sought  quantities 
and  not  their  values,  soiue  other  form  of  search  control 
was  necessary  in  order  to  prevent  EBLC  from  learn¬ 
ing  incorrect  rules.  We  were  surprised  (and  delighted) 
to  find  that  analogical  search  control  provided  enough 
constraint  to  allow  EBLC  to  learn  only  correct  rules 
during  problem  solving. 


EBLC  is  similar  to  other  knowledge-level  learning  meth¬ 
ods  whose  non-domain  rules  take  the  form  of  explana¬ 
tion  patterns  (Schank,  1986),  causal  attribution  rules 
(Pazzani,  Dyer  &  Flowers,  1986;  Lewis,  1988)  and  de¬ 
terminations  (Widma',  1989).  EBLC  uses  the  same 
format  for  both  domain  and  non-  domain  rules,  which 
should  simplify  acquisition  of  non-domain  rules  via 
syntactic  generalization.  Also,  EBLC  operates  during 
both  problem  solving  and  example  explaining,  whereas 
most  of  its  predecessors  operate  only  while  explaining 
examples,  and  thus  did  not  need  the  strong  search  con¬ 
trol  that  analogical  search  control  provides.  The  dis¬ 
covery  of  the  interaction  between  search  control  learn¬ 
ing  and  this  class  of  knowledge  level  learning  may  be 
the  most  important  technical  contribution  of  the  Cas¬ 
cade  research. 

TAKING  THE  EXAMPLE’S  WORD 
FOR  IT 

One  of  the  original  62  domain  rules  could  not  be  learned 
via  EBLC  from  overly  general  rules  nor  from  common 
sense  physics.  The  rule  applies  first  in  an  example 
and  then  in  a  series  of  problems,  all  of  which  have 
three  strings  converging  on  a  knot,  with  various  forces 
pulling  on  each  of  the  strings.  The  rule  simply  says 
that  the  knot  is  the  body  (which  means  that  forces  will 
be  resolved  on  it).  The  text  leads  students  to  believe 
that  bodies  are  physical  objects  with  mass,  but  the 
strings  are  massless  and  so  presumably  are  the  knots. 
All  the  subjects  that  self-explained  the  example  line 
stating  that  the  knot  was  the  body  found  the  line  con¬ 
fusing.  Alter  searching  in  vain  for  an  explanation,  they 
all  wound  up  ’’taking  the  example’s  word  for  it.”  That 

is,  they  accepted  the  line  as  correct  even  though  they 
could  not  derive  it  themselves.  On  subsequent  three- 
string  problems,  they  would  again  make  the  assump¬ 
tion  that  the  knot  is  the  body,  often  referring  to  the 
example  by  name. 

To  model  this  behavior,  we  tried  syntactic  techniques 
for  learning  by  completing  explanations  (e.g.,  Van- 
Lehn,  1987;  Hall,  1988),  but  could  not  get  the  rule 
learned  during  the  example  to  apply  to  all  the  prob¬ 
lems.  We  relaxed  transformational  analogy  enough  to 
allow  it  to  transfer  the  knowledge  from  examples  to 
problems,  but  this  made  it  too  liberal,  and  it  often 
transferred  incorrect  knowledge.  We  solved  the  prob¬ 
lem  with  an  ad  hoc  but  interesting  method.  When  the 
example  explainer  has  to  ’’take  the  example’s  vi-ord  for 

it, ”  it  builds  a  special  rule  that  says,  ”lf  the  goal  is  to 
find  a  body,  try  a  transformational  analogy  to  the  knot 
of  example  sx,”  where  example  sx  is  the  one  with  the 
three  strings  in  it.  Thus,  Cascade  learns  which  kind  of 
situations  are  good  ones  for  applying  transformational 
analogy. 

EVALUATION 

The  mechanisms  of  Cascade  were  constructed  to  sim¬ 
ulate  behavior  directly  visible  in  the  protocols  and  to 
model  learning  inferred  from  the  protocols.  It  is  not 


clear  whether  these  mechanisms  will  actually  repro¬ 
duce  the  self-explanation  effect  found  by  Chi  et  al. 
Although  the  effect  consists  of  correlations  between 
five  measures  (VanLehn,  Jones  &  Chi,  in  press),  we 
will  here  discuss  only  the  most  important  correlation: 
Students  who  utter  more  self-explanatory  statements 
while  studying  examples  also  answer  more  problems 
correctly. 

Our  hypothesis  is  that  ♦he  correlation  is  caused  by  a 
strategy  difference  rather  than  a  prior  knowledge  dif¬ 
ference.  Thus,  we  ran  Cascade  twice  and  gave  it  the 
same  prior  knowledge  both  times  (the  29  rules  judged 
to  be  mentioned  in  the  text).  On  the  so-called  Good 
Student  run,  we  had  Cascade  self-explain  every  line 
of  all  3  examples  from  the  Chi  et  al.  study.  On  the 
Poor  Student  run,  we  had  it  explain  no  lines.  As  ex¬ 
pected,  this  strategy  difference  during  example  study¬ 
ing  caused  a  large  difference  during  problem  solving. 
The  Good  Student  solved  all  23  of  the  Chi  et  al.  prob¬ 
lems  correctly,  while  the  Poor  Student  solved  only  9 
problems  correctly.  Because  Cascade  is  deterministic, 
explaining  an  intermediate  number  of  example  lines 
would  have  caused  an  intermediate  score  on  the  prob- 
lern  solving  phase.  Thus,  Cascade  does  reproduce  the 
observed  correlation. 

According  to  Cascade,  there  are  several  sources  for 
this  correlation.  First,  when  more  lines  are  explained, 
there  is  more  opportunity  for  Cascade  to  stumble  across 
gaps  in  its  domain  knowledge.  These  gaps  cause  EBLC, 
which  fills  in  the  gaps  with  new  domain  rules.  The 
Good  Student  learned  7  rules  during  example  study¬ 
ing.  Of  course,  the  Poor  Student  learned  none,  as  it 
didn’t  even  process  the  examples. 

The  Good  Student  learned  15  rules  during  problem 
solving.  Of  these,  3  were  also  learned  by  the  Poor 
Student.  In  order  to  determine  why  the  Good  stu¬ 
dent  learned  12  more  rules  than  the  Poor  student,  we 
ran  Cascade  on  the  problems  with  analogical  search 
control  turned  off.  It  was  given  the  36  Good  Student 
rules  (29  rules  from  initial  knowledge  and  the  7  rules 
learned  during  example  studying).  It  learned  3  of  the 
12  rules  that  were  learned  by  the  Good  student  and 
not  the  Poor  student.  This  shows  that  the  knowledge 
acquired  during  example  studying  set  up  contexts  dur¬ 
ing  problem  solving  that  enabled  further  rule  acquisi¬ 
tion  In  order  to  control  search  enough  to  learn  the 
remaining  9  rules,  analogical  search  control  was  nec¬ 
essary.  To  put  it  anotlier  way,  of  the  22  rules  learned 
by  the  Good  student,  3  can  be  learned  during  problem 
solviiig  even  if  the  examples  are  not  self-  explained, 
10  can  be  learned  by  self-explaining  the  examples,  and 
9  require  both  .self-explanation  and  analogical  search 
control. 

DISCUSSION 

The  research  so  far  has  shown  that  Cascade  is  quali¬ 
tatively  similar  to  human  behavior  in  that  it  can  solve 
problems,  both  analogically  and  non-analogically,  and 
it  can  explain  examples.  Moreover,  both  major  strate¬ 


gies  for  studying  examples  ate  modelled,  as  well  as 
several  types  of  analogical  reference.  The  research  has 
also  shown  that  Cascade  reproduces  the  general  corre¬ 
lations  observed  in  aggregate  data,  especially  the  most 
important  correlation,  which  indicates  that  amount  of 
learning  is  related  to  amount  of  self-explanation. 

The  next  step  in  the  research  is  to  try  to  fit  the  proto¬ 
cols  of  individual  subjects.  Given  a  student’s  protocol, 
Cascade  will  be  forced  to  explain  exactly  those  lines 
that  the  student  explained.  Will  this  cause  Cascade 
to  make  the  same  errors  as  the  subject?  If  the  subject 
makes  errors  that  Caiscade  does  not  and  Cascade  uses  a 
rule  that  it  learned  via  EBLC,  then  EBLC  may  be  too 
powerful.  If  the  student  makes  an  error  that  Cascade 
does  not  and  Cascade  uses  only  the  29  prior  knowledge 
rules,  then  our  assumption  that  students  all  have  the 
same  prior  knowledge  is  inaccurate.  If  Cascade  makes 
errors  that  the  subject  does  not,  then  the  subject  may 
have  used  some  kind  of  learning  that  Cascade  does 
not  model.  Thus,  this  further  work  should  test  our 
hypothesis  that  the  self-  explanation  effect  is  caused 
by  a  strategy  difference  and  not  by  a  prior  knowledge 
difference,  and  it  should  help  determine  whether  Cas¬ 
cade’s  learning  models  are  appropriate  and  complete. 

Although  it  would  be  premature  to  diaw  strong  con¬ 
clusions  about  the  psychological  adequacy  of  Cascade 
from  the  present  research,  the  work  taught  us  much 
about  the  technical  difficulties  of  integrating  several 
types  of  learning  and  problem  solving.  EBLC  looked 
like  it  could  not  fail  to  learn,  as  we  could  write  any 
overly  general  rule  we  liked.  Moreover,  it  seemed  cer¬ 
tain  that  it  would  reproduce  the  self-  explanation  ef¬ 
fect,  because  only  the  self-  explaining  students  pro¬ 
cessed  the  examples  and  we  thought  all  learning  oc¬ 
curred  there.  However,  we  found  it  impossible  to  get 
EBLC  to  learn  with  just  overly  general  rules;  we  bad 
to  embrace  diS»»ssa’s  hypothesis  that  formal  physics 
evolves  from  naive  physics.  We  also  found  that  most 
rules  are  not  learned  during  example  explaining,  but 
during  problem  solving.  So  why  does  explaining  the 
examples  help?  It  turns  out  the  examples  help  be¬ 
cause  they  control  the  search  (via  an.ilogical  search 
control)  during  problcni  solving  Without  such  search 
control,  Cascade  wanders  down  false  paths  and  EBLC 
learns  incorrect  rules.  In  short,  we  learned  that  the 
self-explanation  effect  is  not  due  solely  to  knowledge- 
level  learning,  but  that  search  control  learning  is  nec¬ 
essary  in  order  to  properly  bias  knowledge-  level  learn¬ 
ing.  This  may  be  a  general  principle  that  applies  to  all 
EULC-like  learning  methods  (e  g.,  Lewis,  19SS;  I’az- 
zani,  Dyer  Flowers,  1986;  Schank,  1986;  Widmar, 
1989)  and  perhaps  even  to  all  iinp.asse  or  failure  il riven 
knowledge-level  learning  methods. 
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